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Abstract. This paper is dealing with DNA cyclic codes which play 
an important role in DNA computing and have attracted a particular 
attention in the literature. 

Firstly, we introduce a new family of DNA cyclic codes over the ring R = 
F 2 [u]/(u®). Such codes have theoretical advantages as well as several 
applications in DNA computing. A direct link between the elements 
of such a ring and the 64 codons used in the amino acids of the living 
organisms is established. Such a correspondence allows us to extend the 
notion of the edit distance to the ring R which is useful for the correction 
of the insertion, deletion and substitution errors. Next, we define the 
Lee weight, the Gray map over the ring R as well as the binary image 
of the cyclic DNA codes allowing the transfer of studying DNA codes 
into studying binary codes. Secondly, we introduce another new family 
of DNA skew cyclic codes constructed over the ring i? = F 2 + UF 2 = 
{0, l,v,v + 1} where = v and study their property of being reverse- 
complement. We show that the obtained code is derived from the cyclic 
reverse-complement code over the ring R. We shall provide the binary 
images and present some explicit examples of such codes. 


1. Introduction 

Deoxyribonucleic acid (DNA) contains the genetic program for the bio¬ 
logical development of life. DNA is formed by strands linked together and 
twisted in the shape of a double helix. Each strand is a sequence of four 
possible nucleotides, two purines; adenine (A), guanine (G) and two pyrim¬ 
idines; thymine (T) and cytosine (C). The ends of the DNA strand are 
chemically polar with 5' and 3' ends. Hybridization, known as base pairing, 
occurs when a strand binds to another strand, forming a double strand of 
DNA. The strands are linked following the Watson-Crick model; every (A) 
is linked with a (T), and every (C) with a (G), and vice versa. We denote by 
X the complement of x defined as follows, A = T,T = A,G = C and C = G 
(for instance if x = (AGGTAG), then its complement x = [TGGATG). 
DNA computing combines genetic data analysis with science of the compu¬ 
tation in order to tackle computationally difficult problems. This new field 

started by Leonard Adleman [3], who solved a hard computational problem 
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by DNA molecules in a test tube. Several authors have contributed to pro¬ 
vide constructions of cyclic DNA codes over fixed rings. In [2|I14| . the authors 
gave DNA cyclic codes over finite field with four elements. Further, Siap et 
al. have studied in m cyclic DNA codes over the ring ¥ 2 [u\/{u‘^ — 1) using 
the deletion distance. More recently, Guenda et al. have studied in [8] cyclic 
DNA codes of arbitrary length over the ring ¥ 2 [u\/{u'^ — 1). Those codes 
have several applications as well as high hybridization energy. In this paper 
we firstly consider the DNA codes of length n over the ring R = F 2 [u]/(n®). 
The ring i? is a principal commutative ring with 64 elements. With four 
possible bases, the three nucleotides can give 4^ = 64 different possibilities, 
called codons. These combinations are used to specify the 20 different amino 
acids used in the living organisms [3]. To this end, we construct a one-to- 
one correspondence between the elements of R and the 64 codons over the 
alphabet {A,G,C,T}^. Such a correspondence allows us to extend the no¬ 
tion of the edit distance to the ring R. The edit distance is an important 
combinatorial notion for the DNA strands. It can be used for the correction 
of the insertion, deletion and substitution errors between the codewords. 
This it is not the case for the Hamming, deletion, and the additive stem 
distances. For that in this paper, we design cyclic reverse-complement DNA 
codes over the ring R with designed edit distance D. We also give some 
upper and lower bounds on D. We define a Lee weight and a Gray map 
over R. The images of our DNA codes under the mapping are quasi-cyclic 
codes of index 6 and of length 6n over the alphabet {A, G,C,T}. There are 
several advantages in using codes over the ring R. We list some of them 
below: 

(1) There exists a one-to-one correspondence between the codons and 
the elements of the ring R. 

(2) A code over R can contains more codewords than codes of the same 
length over fields. 

(3) The factorization of x” — 1 is the same over the field F 2 but is not 
the same over other rings. This fact simplifies the construction of 
cyclic codes over R. 

(4) The structure of the cyclic codes of any length over R is well-known [9] , 
whereas little is know concerning the structure of cyclic codes of any 
length over rings. 

(5) The cyclic character of the DNA strands is desired because the ge¬ 
netic code should represent an equilibrium status m- Another ad¬ 
vantage of cyclic codes, as indicated by Milenkovic and Kashyap [12j . 
is that the complexity of the dynamic programming algorithm for 
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testing DNA codes for secondary structure will be less for cyclic 
codes. 

(6) The binary image of the cyclic codes over R under our Gray map 
are linear quasi-cyclic codes. 

In the second part of the paper, we study the skew cyclic DNA codes over 
the ring i? = F 2 + UF 2 = {0,1, u, u + 1}, where = v. The codes obtained 
satisfy the reverse-complement. Further we give the binary images of the 
skew cyclic DNA codes and provide some examples. The advantage of study¬ 
ing the reversible DNA code in skew polynomial rings is to exhibit several 
factorizations. Therefore, many reverse-complement DNA code could be 
obtained in a skew polynomial ring (which is not the case in a commutative 
ring). 

This paper is organized as follows. In Section 2 we start by presenting 
some preliminaries results as well as the one-to-one correspondence between 
the element of the ring R = F 2 [u]/(u®) and the codons. Next, we give the 
algebraic structure of the cyclic codes over R = F 2 [u]/(u®) and we study 
the DNA cyclic codes and reverse-complement of these codes. Moreover, we 
define the Lee weight related to such codes and give the binary image of 
the cyclic DNA code. Some explicit examples of such codes are presented. 
In Section 3, we describe a skew cyclic DNA codes over i? = F 2 -|-uF 2 = 
{0, l,u,u -|- 1} where = v. We study their property of being reverse- 
complement and provide explicit examples of such codes with minimum 
Hamming distance. 

2. Cyclic DNA Codes over R = F2[u]/(u®) 

2.1. Some Background and Preliminaries. The ring considered in this 
section is 

R = F2[u]/(u®) = {ao -|- aiu + a2U^ + + a^u^]ai G F 2 ,u® = 0}. 

It is a commutative ring with 64 elements. It is a principal local ideal ring 
with maximal ideal (u). The ideals of R satisfy the following inclusions 

(0) = (u®) C (u®) C (m^) C c (u^) C (u) C {R). 

Since the ring R is of the cardinality 64, then we can construct a one- 
to-one correspondence between the elements of R and the 64 codons over 
the alphabet {A, G, C, T}^ by the map (j), this is given in Table 1. A simple 
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verification give that for all x € -R, we have 

(1) X + X = vP + + u + 1. 

Now, since BP is an R-module, a linear code over R of length n is a sub- 
Table 1. Identifying Codons with the Elements of the Ring R. 


ccc 

+ R + 1 

GGG 

0 

ACT 

+ 1 

GTG 

it^ + it^ + n + 1 

GGA 

+ u'^ + u 

GGT 

1 

ACG 

+ n 

AGA 

+ u^ + u + 1 

GGC 

+ u* + + 1 

CGG 

u 

TTT 

+ 1 

GAG 

+ u'^ + 1 

GGT 


CCA 

u + 1 

TTG 

u 

AGG 

+ u + 1 

AGG 

+ u + 1 

TCC 


CTA 

+ li + 1 

GAT 

+u^ + 

CGG 

+U*‘ +u'^ +U+1 

GCC 


GTT 

«'* + + 1 

GTA 

u 1 

GAG 

u 1 

CTC 


GTG 

u 

ATT 

u* + + 1 

AGA 

u 

TCT 

+ 1 

TCA 

+ It + 1 

ATA 

It^ + + -lA^ + U 

AGO 

+ 1 

TCG 

-\- u 

CAA 

u 

ATC 


ATG 

u‘^ u 1 

TAC 


CAC 

u 

TGA 

It^ + It^ + it 

AGT 

+ u*‘ + 

TAT 

li® +1 

GCA 

It^ + It + 1 

AAT 

+ u'^ + u + 1 

GGA 

+ li 

GCT 

+ 1 

TTA 

'u'* + 

AAA 

+ u 

CGG 

+u*‘ +u^ + l 

GCG 

-\- u 

ACC 

3 1 2 

It + It 

TGC 

u® + + 1 

GGT 

«^ + ti* + 

TAA 

u 

CAT 

5 1 2 

It + It 

AAC 

+ 1 

TGG 

u‘^ u 1 

CTG 

-\- u 

TGT 

+ u"* 

TCC 

u* + -u^ 

GAA 

-\- u 

CTT 

u* + 1 

CAG 


TAG 

It^ + It + 1 


module C of R^. An (n,/c) linear block code of dimensions n = ml, is 
called quasi-cyclic if every cyclic shift of a codeword by I symbol yields 
another codeword. Define the Hamming weight of the codeword x G C as 
wh{x) = Uaiix), where a, G R*. The Hamming distance dH{x,y) between 
the vector x and y is wh{x — y). Let x = xqXi ... x„_i be a vector in R^. 
The reverse of x is defined as x^ = Xn-iXn -2 ■ ■ - XiXq, the complement of x 
is x^ = xqXi ... Xn-i, and the reverse-complement, also called the Watson- 
Crick complement (WCC) is defined as x^'^ = Xn-iXn -2 • • • xiXq. A code C 
is said to be reversible if for any x G C we have x*” G C. C is said to be 
reverse-complement if for any x G C we have x^*^ G C. 


We shall use the edit distance for correction of the insertion, deletion 
and substitution errors in codewords. It is the minimum number of the 
operations (insertion, substitution and deletion) required to transform one 
string into the other. The edit distance can be defined as follows m- 
Let A and B be finite sets of distinct symbols and let x* G denotes 
an arbitrary string of length t over A. Then xj denotes the substring of 
X* that begins at position i and ends at position j. The edit distance is 
characterized by a triple {A, B, c) consisting of the finite sets A and B, and 
the primitive function c: E ^ R_|_, where M+ is the set of nonnegative reals, 
E = Eg U E^ D Ei is the set of primitive edit operations, Eg = A * B is 
the set of substitutions, Ed = A* E is the set of deletion and Ei = E x B 
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is the set of insertions. Each triple {A, B, c) indnces a distance fnnction 
dc ■ A* X B* ^ M+ which maps a string x* to a nonnegative value, defined 
as follows. 


Definition 2.1. The edit distance dc{x^,y^) between two strings x* € A^ 
and y'" G B"^ is defined recursively as 


dc{x^, y^) = min < 


c{xt,yv) +dc{x^~^,y^~^); 
c{xt,e) + dc{x^-^,y^); 
c(e,y„) + dc{x\y^-^)-, 


where dfie, e) = 0 if e denotes the empty string of length n. 


It is easy to check the following bounds on the edit distance dc- 


Proposition 2.2. Assume that X and Y are strings in R^. Then the 
following holds: 

(i) dc{(j)iX),(j){Y)) < n; 

(ii) dfi<P{X),<p{Y)) <dH{<P{Y),<P{X)y, 

(hi) dfi<P{X),<p{Y)) = dfi<P{Y),<p{X)). 

2.2. Cyclic codes over R = F 2 [R]/(n®). In this subsection we give the 
algebraic structure of the cyclic code of arbitrary length over the ring R. 
We start by giving the definition of cyclic code over this ring. 

Let C be a code over R of the length n. A codeword C = (cq, ci, • • •, c„_i) of 
C is viewed as a polynomial cq + cix + • • • + Cn-ix”“^ G R[x]. Let r be the 
cyclic shift acting on the codewords of C in the following way: 

l(co, Cl, • • •, Cjj_i) — (Cjj_i, Cq, Cl, •, •, •, Cn—2 ) • 

Recall that linear code C is cyclic if C is invariant under permutation r : 
c(x) i-A xc(x) modulo {x^ — 1). 

The following result gives the structure of the cyclic code of arbitrary 
length. 

Theorem 2.3. ( lE,Wi) Let C be a cyclic code of arbitrary length n over the 
ring R. 

(i) If n is odd. Then there exist six polynomials /o, /i, / 2 , /s, / 4 , /s such that 
/ 5 |/ 4 |/ 3 |/ 2 |/i|/o|a;''- 1 andC = (/o, m/i,M^/ 2 ,w^/ 4 , 

(m) If n = m2® such that gcd{m,p) = 1. Then the cyclic codes of length 
n = m2® over R are the ideals generated by (/o, i^/i, 11 ^/ 2 , 11 ^/ 3 , n^/ 4 , 1 x^/ 5 ), 
where fi\fo and fo is divisor of x^ — 1 in ¥ 2 . 

Let denote by K the field R/{u). We have the following canonical ring 
morphism 
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- : R[x] K[x]; f ^ f = f (mod u) 
The rank of C is defined as 


5 

k{C) = Y,k 

i=0 


2 ? 


where the ki are such that ICI = The submodule quotient of 

C by u G i? is the code 


(C : u*) = {u G R^lu'v G C}. 


Thus we have the following tower of linear codes over R. 

(2) C = {C :u) C{C :u‘^) C{C : u^) C (C : u^) C (C : u^) 

For f = 1, • • • , 5 the projections of {C : tt*) over the field K are denoted by 
Tori{C) = {C : u^) and are called the torsion codes associated to the code C. 
The following theorem presents some bounds on the edit distance for the 
cyclic codes defined above. 


Theorem 2.4. LetC = {fo,ufi,u‘^f 2 ,u^f 3 ,u^f 4 ,u^fb) be a cyclic code over 
R of odd length. Then the minimum edit distance dc of C satisfies the fol¬ 
lowing inequalities. 

(i) dc(C) = min{dc(Tori(C))} < min{dH(Tori(C))}; 

(a) dc(C) < min{deg(fi)} + 1; 

(Hi) dc{C) < n — rank{C) + 1. 


Proof. Prom [71 Lemma 5.1] and Proposition 12.21 we have that dc{C) = 
min{dc{Tori{C))} < min{dH{Tori{C))}. Assertion (m) comes from the fact 
that the code Tori{C) and Toro{C) are binary cyclic codes, and satisfy 
(fi) C C. The dimension of Tori{C) is n — deg{fi). By the well-known Single- 
ton bound, we have dc{C) < min{deg{fi)} + 1. Assertion {Hi) follows from 
Proposition 12.21 using the Singleton bound. □ 


2.3. DNA Cyclic Codes. Now, we firstly introduce a DNA cyclic code, 
more precisely, the construction of a [3n, dj-DNA cyclic code. Set I(x) := 
(x” — l)/(x — 1) and a{u) -.= u 1. 

Definition 2.5. Let 1 < D < Hn — 1 he a positive real number. Then a 
cyclic code C of the length n over R is called an [n,D]-cyclic DNA code if 
the following conditions hold: 

(i) C is cyclic code, i.e., C is an ideal in Rn = R[x\/{x'^ — 1); 

(ii) for any codeword x € C, we have {x)^‘^ ^ (x) and {x)'^‘^ G C; 

(hi) dc{x,y) < D for any x,y € C. 
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Condition (ii) given in Definition 12.51 shows that the defined DNA cyclic 
codes are reverse-complement cyclic codes. 

Definition 2.6. Let f{x) G R[x\, denote f{x)* = the reciprocal 

polynomial of f{x). The polynomial f is said to be self-reciprocal if f{x) = 

Lemma 2.7. ( fWf) Let f{x) and g{x) be a polynomials in with 

deg{f{x)) > deg{g{x)). Then the following conditions hold: 

(i) [f{x)g{x)]* = fix)*g{x)*-, 

(ii) [/(x) + g{x)]* = f{x)* -|- 

Theorem 2.8. Let C be a cyclic code of odd length n over R, and that C is 
reverse-complement. Then we have: 

(i) C contains all the codewords of the form a(tt)I(x). 

(ii) C = {fo,ufi,u^f 2 ,u^f 3 ,u'^f 4 ,u^f^); such that all fi self-reciprocal. 

Proof. 

(i) Assume C = {fo,ufi,u‘^f 2 ,u^f 3 ,u'^f 4 ,u^fs) is a cyclic code of odd 
length n over R. Since the zero codeword is an element in C then its 
WCC is also a codeword of C, i.e.; 

(0, (), • • ■ , 0) = («(«), a{u), ■ ■ ■ , a{u)) 

= «(«)(!, 1, • • • , 1) = a{u){x'^ — l)/(x — 1) G C. 

(ii) In the second part we show that f*{x) = fi{x) for 0 < z < 5. 

Let /o(x) = ao + aix + • • • + + amx'^ where /o/(x” — 1) 

in F 2 [x]. One can assume that ao = Um = 1- So that /o(x) = 1-|- 
ui + • • • + + x™. Suppose that /o(x) corresponds to the 

vector (1, oi, • • ■ , 1,0,0 • • • ,0) and the reverse-complement of 0 in ii 

is a{u) then fo‘^{x) = q;(u)(1-|-x-| -|-x"'“"*“^)-|-(Q;(u)-t-l)x"'“”^“^-|- 

am-ix"'“"* -I- • • • -f oix””^ -|- {a{u) + l)x"'“^) G C. Now, since C is a 
linear code, we get 

/o(x)- + a(u)(^^)GC. 

This implies that 

^ ^ aiu))x^-^ + • • • + (ai + a(u))x"-2 + 

= + (a,_i + a{u))x + • • • + (ai + a{u))x'^-^ + x™] G C. 

Multiplying the last polynomial by x™^^ in R[x]/{x^ — 1), we 
obtain: 



NABIL BENNENNI, KENZA GUENDA AND SIHEM MESNAGER 


1 “1“ iflm—1 Old'll)) + ■ ■ ■ + (fll + Oi{u))x^ ^ + X™ G C. 

By Equation ( H]) we see that a + a{u) = a. Therefore, we obtain; 

/o(x) = 1 + am-ix H-h + x'” € C. 

Consequently, we have 

foix) = foko + ufiki ^ -h u^f5k5, 

where fi and ki are all in F 2 [x]. Multiplying both sides of this 
equality by gives 

u^foix) = y^ko{x)fo{x). 

Now, since /q(x),/o(x) € F 2 [x] have the same degree, leading co¬ 
efficient and constant term, one necessary have kQ{x) = 1. Conse¬ 
quently, /o(x) is self-reciprocal. The same argument can be used for 
fij 2 ,h, h and /s as well. 

□ 

In the following, we are interested in providing sufficient conditions for a 
given code C to be reverse-complement. 

Theorem 2.9. Assume that C = {fo,ufi,u‘^ f 2 ,u^ f 3 ,u^f 4 ,u^ f 5 ) is a cyclic 
code of odd length n over R with /5|/4|/3|/2|/i|/o|3;”'—1 G F 2 [x]. //Q;(n)I(x) G 
C and fi{x) are self-reciprocal, then C is a reverse-complement code. 

Proof. Let c(x) be a codewords in C, we have to prove that c(x)'’'^ G 
C. Since C = {fo,ufi,u'^f 2 ,u^f 3 ,u^fi,u^f^) there exist ai(x) G R[x] {i G 
{1, • • • , 5}) such that 

c(x} = foao ufiai + f 2 a 2 + + u^f^oti -b n^/sos- 

Applying the reciprocal and using hrst Lemma 13.71 and next the fact that 
fo{x), /i(x), / 2 (x), fsix), / 4 (x) and / 5 (x) are self-reciprocal, we obtain c*(x) = 

{foaoT + (n/iai)*X-l + + (n3/3«3)*X™3 + (nV4«4)*X™4 + 

{u^f^a^)*x"^^, proving that c(x)* is in C. Since C is cyclic, x”'“*“^c(x) = 
cox"’“*“^ -b cixn — t -b • • • -b ctx”“^ G C. It was also assumed that a{u) -b 
a{u)x -b ■ ■ ■ -b a(tt)x"'“^ G C, which leads to 

a{u) -b a{u)x -b • • • -b a{u)x'^~^ -b cox"'“*“^ -b cixn — t -b • • • -b ctx'^~^ G C. 

This is equal to a{u) + a{u)x -b • • • -b a(u)x^~^~‘^ + {a{u) + co)x"'“*“^ H-b 

(a(ti)-bct)x"'“^ = a{u)-\-a{u)x -ba(tt)x"'“*“^-|--bCix"""^, 

which is precisely (c*(x)'’‘^)* = c(x)'’'^ G C. □ 
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Theorem 2.10. Assume thatC = {fo,ufi,u‘^f 2 ,u^f 3 ,u‘^f 4 ,u^f 5 ) be a cyclic 
code of even length n = m2^ over R such that /i|/o in F 2 [x]. If a{u)I{x) G 
C and fi{x) (i G {I,-- - ,5}j are self-reciprocal. Then C is a reverse- 
complement code. 

Proof. The proof can be done in a similar manner than the one of 
Theorem 12.91 □ 

Corollary 2.11. Let C be a cyclic code of length n = m2®, s > 0. If 
a{u)I{x) G C and there exists an integer i such that 

(3) 2* = -l[m]. 

Then C is a reverse-eomplement code. 

Proof. The proof is similar to the proof of Corollary 4.14 in [8]. □ 


Definition 2.12. For a cyclie code C = {fo,ufi,u^f 2 ,u^fsTu'^f 4 ,,u^f^), we 
define the sub-code Cy 2 consisting of all codewords in C that are a multiple 
of u^. 

Lemma 2.13. With the previous notations we have: 

(i) (p{u^R) = {GGG, AGT, CGT, TGT, GAT, GAG, TAG, ATG, GTG, TGG, GTG, AGG, 
CTC, TGG, GGT, GAT}, 

(fiu^R) = {GGG, TGT, GAG, TAG, GTG, GGG, AGT, TTA} and 
4>{u^R) = {GGG, TAG, CTC, TGT}. 

(ii) If C = {fo,ufi,u‘^f 2 ,u^f 3 ,u'^f 4 ,u^f 5 ) is the cyclic code of length n 
over R. Then Cy 2 = {u^ f^) over the alphabet (f{u^R). 

Proof. The part (i) is obtained by a simple calculation. 

For the part (ii), assume that C = {fo,ufi,u‘^f 2 ,u^f 3 ,u^fi,u^fd). Since 
/ 5 |/ 4 |/ 3 |/ 2 |/i|/o|a;” - 1, then we obtain C C,, 2 . Conversely, assume 

that c{x) G C such that c{x) = ao{x)fo{x) + uai{x)fi{x) + u'^a 2 {x)f 2 {x) + 
u^a 3 (x)/ 3 (x) + u '^/4 + for all a* G F 2 [®]. If c(x) is a multiple of then 
x"’ — 1 divides ao(x)fo(x) and x” — 1 divides ai(x)/i(x). Hence, c(x) = 
u^a 2 (x)f 2 (x) + u^a 3 (x)f 3 (x) + tt^Q; 4 (x)/ 4 (x) + u^a 5 (x)f 5 (x). Therefore, 

Cu 2 C {u'^fb). Consequently Cy _2 = {u^ f^), which completes the proof. □ 

Remark 2.14. The cyclic DNA codes which are obtained in the Lemma \2.13\ 
are stable across the error in the DNA strands by the usage of the codons, 
see l^. 

Any codeword of sub-code Cy 2 = {u^ f^) over R) contains the nucleotide 
C and G. This is an interesting thermodynamic property of the DNA strand. 

For its importance please see If. 
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Example 2.15. We have thatx^—1 = (x—l)(x^+x+l)(x^+x^+l) = / 0 / 1/2 
mF 2 [x]. The Table 2 represents the cyclic DNA code with minimal Hamming 
distance. 

Table 2. Cyclic DNA Codes with Minimal Hamming Distance 


Number of the Code 

Generators 

Type of the Code 

1 

< u'^fo > 

(7,4096,2) 

2 

< U^fl > 

(7,256,3) 

3 

< vff2 > 

(7,256,3) 

4 

< W^/ 1/2 > 

(7,4,7) 

5 

< > 

(7,64,4) 

6 

< w^/ 0/2 > 

(7,64,4) 


Table 3. A DNA Code with Minimal Edit Distance Ob¬ 
tained from C = (w^/ 0 / 1 ) of the Type (7,64,6) 


GGGGGGGGGGGGGGGGGGGGG 

CTCGGGCTCGTGGTGGGGGGG 

GGGGTGGGGGTGTGTTGTTGT 

TGTGGGCTGGGGGTGTGTTGT 

TGTTGTGGGGTGGGGGTGTGT 

TGTTGTTGTGGGGTGGGGGTG 

CTCTGTTGTTGTGGGGTGGGG 

GGGGTGTGTTGTTGTGGGGTG 

TATGGGTATTATTATGGGGGG 

GGGTATGGGTATTATTATGGG 

GGGGGGTATGGGTATTATTAT 

TATGGGGGGTATGGGTATTAT 

TATTATGGGGGGTATGGGTAT 

TATTATTATGGGGGGTATGGG 

GGGTATTATTATGGGGGGTAT 

TGTGGGTGTTGTTGTGGGGGG 

GGGTGTGGGTGTTGTTGTGGG 

GGGGGGTGTGGGTGTTGTTGT 

TGTGGGGGGTGTGGGTGTTGT 

TGTTGTGGGGGGTGTGGGTGT 

TGTTGTTGTGGGGGGTGTGGG 

GGGTGTTGTTGTGGGGGGTGT 

CTCGGGCTCTGTTGTTGTGGG 

GGGGTGGGGGTGTGTTGTTGT 

TGTGGGCTGGGGGTGTGTTGT 

TGTTGTGGGGTGGGGGTGTGT 

TGTTGTTGTGGGGTGGGGGTG 

CTCTGTTGTTGTGGGGTGGGG 

GGGCTCTGTTGTTGTGGGCTC 

GGGGGGCTCGGGCTCCTCCTC 

CTCGGGGGGCTCGGGCTCCTC 

CTCCTCGGGGGGCTCGGGCTC 


CCCCCCCCCCCCCCCCCCCCC 
GAGCCCGAGGAGGAGCCCCCC 
CCCGAGCCCGAGACAATAACA 
ACACCCGAGCCCGAGACAACA 
ACAACACCCGAGCCCGAGACA 
ACAACAACACCCGAGCCCGAG 
GAGACAACAACACCCGAGCCC 
CCCGAGACAACAACACCCGAG 
ATACCCATAATAATACCCCCC 
CCCATACCCATAATAATACCC 
CCCCCCATACCCATAATAATA 
ATACCCCCCATACCCATAATA 
ATAATACCCCCCATACCCATA 
ATAATAATACCCCCCATACCC 
CCCATAATAATACCCCCCATA 
ACACCCACAACAACACCCCCC 
CCCACACCCACAACAACACCC 
CCCCCCACACCCACAACAACA 
ACACCCCCCACACCCACAACA 
ACAACACCCCCCAGACCCACA 
ACAACAACACCCCCCACACCC 
CCCACAACAACACCCCCCACA 
GAGCCCGAGACAACAACACCC 
CCCGAGCCCGAGACAACAACA 
ACACCCGAGCCCGAGACAACA 
ACAACACCCGAGCCCGAGACA 
ACAACAACACCCGAGCCCGAG 
GAGACAACAACACCCGAGCCC 
CCCGAGACAACAACACCCGAG 
CCCCCCGAGCCCGAGGAGGAG 
GAGCCCCCCGAGCCCGAGGAG 
GAGGAGCCCCCCGAGCCCGAG 
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2.4. Binary Image of DNA Codes. In this Section we will define a Gray 
map which allows us to translate the properties of the suitable DNA codes 
for DNA computing to the binary cases. The definition of the Gray map ip 
from R to F 2 , for each element of F 2 is expressed as 

ip{ao + aiu^ + a 2 U^ + + a^u^) = (uq, ui, 02, 03, 04, 05), 

where ai € F 2 . We define Lee weight over the ring R by 

1=5 

WLee{ao + aiu^ + a2u‘^ + + a^u'^ + osu®) = ^ Oj. 

i=0 

The Lee distance dL{x, y) between the vector x and y is WLee{x — y)- Accord¬ 
ing to the definition of the Gray map, it is easy to verify that the image of 
a linear code over ii by <y9 is a binary linear code. We can obtain the binary 
image of the DNA code by the map ip and the map (j). In Table 4 we give the 
binary image of the codons. The binary image of DNA code resolved the 
problem of the construction of DNA codes with some properties, see [12] . 

Table 4. Binary Image of the Godons 


GGG 

000000 

GGG 

mill 

TAT 

000001 

ATA 

111110 

GGA 

011111 

GGT 

100000 

TAG 

100001 

ATG 

011110 

GGG 

101111 

GGG 

010000 

TAA 

010001 

ATT 

101110 

GGT 

001111 

GGA 

110000 

TAG 

110001 

ATG 

001110 

AGG 

110111 

TGG 

001000 

CAT 

001001 

GTA 

110110 

AGA 

010111 

TGT 

101000 

GAG 

011001 

GTG 

100110 

AGG 

loom 

TGG 

011000 

CAA 

011001 

GTT 

100110 

AGT 

000111 

TGA 

111000 

GAG 

111001 

GTG 

000110 

GGG 

moil 

GGG 

000100 

AAT 

000101 

TTA 

111010 

GGA 

011011 

GGT 

100100 

AAC 

100101 

TTG 

011010 

GGG 

101011 

GGG 

010100 

AAA 

010101 

TTT 

101010 

GGT 

001011 

GCA 

110100 

AGG 

110101 

TGG 

001010 

TGG 

110011 

ACC 

001100 

GAT 

001101 

GTA 

110010 

TGA 

010011 

ACT 

101100 

GAG 

101101 

GTG 

010010 

TGG 

100011 

AGG 

011100 

GAA 

011101 

GTT 

100010 

TGT 

000011 

AGA 

111100 

GAG 

111101 

GTG 

000010 


The following property of the binary image of the DNA codes comes from 
the definition. 

Lemma 2.16. The Gray map ip is a linear distance preserving 
(R^, Lee distance)^ (¥ 2 ^^^, Hamming distanee). 
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Further if C is a cyclic DNA code over R then (p{C) is a binary quasi-cyclic 
DNA code of the length 6n and of index 6. 

Proof. Let C be a cyclic DNA code of length n over R. Hence is a 
set of length 6n over the alphabet F 2 which is a quasi-cyclic code of index 
6. It is easy to verify that the Gray map is a linear distance preserving. □ 

3. Skew Cyclic DNA Codes over .R = F 2 -I- i;F2 = {0, l,v,v + 1} 

3.1. Notation and Prelimenaries. The ring considered in this section is 
the non-commutative ring R\x]9]. The structure of this non-commutative 
ring depends on the element of the commutative ring R = {0, l,u,u -1- 1}, 
where = v and the automorphism 6 on R, dehned by 0(0) = 0, 0(1) = 1, 
9{v) = v + 1, 9{v + l) = V. Note that 0^(a) = 9{9{a)) = a for all a £ R. This 
implies that 0 is a ring automorphism of order 2. The skew polynomial ring 
R[x; 9] is the set of polynomials R[x] 0] = {oq -|- aix -|- a 2 X^ anX^\ai}, 

where addition is the usual polynomial addition and the multiplication 
is not commutative over R, denote by * and dehned by the basic rule 
(ax*) * {hx^) = a0*(6)x*+-^ and the distributive and the associative laws. 

We have one-to-one map V’ between the elements of R and the DNA nu¬ 
cleotide base {A, T, C, G} given hyO^G,v^C,v + l^T and 1 —> A. 
A simple verihcation give that for all x £ R, we have 

(4) 0(x) -|- 0(x) = V + 1. 

In the following, we only consider codes with even length. 

Definition 3.1. Let R = F 2 -|- XF 2 = {0, l,v,v + 1} he a ring where = v 
and the automorphism 9 defined previously. A subset C of R” is called a 
skew cyclic code (9-cyclic code) of the length n if 

(1) C is a R-suhmodule of R” and 

(2) if C= (co,Ci,--- ,Cn-l) £ C then (0(Cn-l), 0(co), • • • ,0(Cn-2)) G C 

The ring R„ = R[x;0]/(x"' — 1) denotes the quotient ring of R[x;0] by 
the (left) ideal (x** — 1). Let /(x) G R„ and r(x) £ R[x;0], we dehne the 
multiplication from left as: 

(5) r(x) * (/(x) -b (x** — 1)) = r(x) * /(x) -b (x** — 1) 
for any r(x) £ R[x; 0]. Dehne a map as follows 

e : R” ^ R[x;0]/(x** - 1) 

(co,Ci, • • • ,Cn-l) Co -b CiX -b C 2 X^ H-b Cn-ix"'”^ 
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It is clear that ^ is an i?-module isomorphism map. This implies that each 
element (cq + ci • • • + Cn-i) G can be identified by the polynomial c{x) = 
Co + CiX + C2X^ H-h G Rn- 

Lemma 3.2. ( |Il Lemma 1]) If n is even, and re” — 1 = g{x) * fix) in 
R[x; 6], then x” — 1 = g{x) * fix) = fix) * gix) 

The following proposition gives any structures of the skew cyclic codes 
over Rn- 

Proposition 3.3. ( JH Corollary 3]) Let C be a skew cyclic code in Rn- 
Then 

(1) If a polynomial gix) of least degree in C is a monic then C = (^(x)), 
where gix) is (skew) right divisor of x"^ — 1. 

(2) If C contains some monic polynomial hut no polynomials fix) least 
degree in C is monic. Then C = (/(x), 5 '(x)), where gix) is a monic 
polynomial of least degree in C and fix) = u/i(x) or fix) = (n + 
l)/i(x) for some polynomial binary /i(x). 

(3) If C does not contain any monic polynomials. Then C = (/(x)) 
where fix) = n/i(x) or fix) = (n + l)/i(x) and /i(x) is a binary 
polynomial that divides x"' — 1. 

Now, we are interesting in constructions of [n, dj-skew cyclic DNA codes. 
To this end, we start by defining such codes. 

Definition 3.4. Let 1 < d < n — 1 be a positive real number. A skew cyclic 
code C over R is said to be a [n,d]-skew cyclic DNA code if the following 
conditions hold. 

(1) C is a skew cyclic code, that is, C is a R-submodule of Rn 

(2) for any codeword X G C: ^ (A) and (A)'’^ G (C) 

(3) duiX, Y)<d for any X,Y eC. 

3.2. The Reverse-Complement Skew Cyclic Codes over R. In this 
subsection, we give conditions on the existence of the reverse-complement 
cyclic codes of the even length n over the ring R. 

Let V = (ao, ai, • • ■ , o„_ 2 , On-i) be a vector in Rn, the reverse of the vector 
n is = (an-i,an- 2 , • • • ,ai,ao)- Let /(x) is the polynomial correspond of 
the vectors such that /(x) = ao+aix-|-- • •+a„,-ix"'“^, we find the polynomial 
correspond of the vector in i?[x; 0], we multiply the right of the polynomial 
/(x“^) by x”"^, we obtain: /(x“^)x"'“^ = aox""”^ + aid(l)x"'“^ + ■■■ + 
an- 2 ^”“^(l)a: + an9^~^il) = On-i + an- 2 X + • • • + aix"-”^ -I- aox"-”^. This 
polynomial corresponds of the vector denoted by /'’(x) (see Section 1). 
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Definition 3.5. Let f{x)* = f{x~^) * be the reciprocal polynomial 

of a given f{x) in R[x]9]. The polynomial f is called self-reciprocal if f 
coincides with f*. 

Example 3.6. Let f be a polynomial in R[x', 9] such that, 

f{x) = x^-Lvx"^-\-{v-\-l)x-\-v, this polynomial represented the DNA sequence 

X(ACTC), we find the reverse of the sequence X by f*{x). 

f*{x) = f{x~^)x^ = 10^(1) + v9‘^{l)x + (f + l)9{l)x‘^ + v9^{l)x^ = vx'^ + 

{v + l)x^ + vx + 1. Then we have the reverse of the DNA sequence of X is 
(CTCA). 

Notice that the definition of reciprocal polynomial over R[x, 9\ is being 
different from the one defined over a commutative ring. Indeed, in the 
non-commutative ring R[x,9], we use the right multiplication over the au¬ 
tomorphism 9 and the multiplication over R[x,9]. 

Lemma 3.7. Let f{x) and g{x) be polynomials in R[x,9] with deg{f{x)) > 
deg{g{x)). Then the following assertions hold: 

(i) [fix)g{x)]* = f{x)*g{x)* 

(ii) [/(x) -|- g{x)]* = f{x)* -|- g{x)x‘^’^3U)-deg{g) 

Proof. Assertion {i) assume that /(x) = g{x) = bjX^, 

such that the deg{f) > deg{g), we have f{x)g{x) = YJktl Yli=o ai9\bk-i)x^, 
by Definition l3.2[ we have that {f{x)g{x))* = o.i9'^{hk-i)x~^)x'^~^'P. 

Then we have {f{x)g{x))* = Ylt^oYli=oC'i^^ibk-i)x"'~^^~’‘. By the Defini- 
tion[32]/(a;)* = Er=o = Er=o and 5 (x)* = 

Y7j=o bjxP~T Consequently we have f{x)*g{x)* = Ylltl Ei=o ai9\bk-i)x^^P~^. 
Hence the result follows. 

Assertion (m), by Definition 13.21 we have 

(fix) + g{x))* = (/ + g)*{x) = ((/ -h g)ix-^))x^^3if) 

= if*ix) + g{x-^x^^9{f)) = f*(^x) + g(^x-^)x‘i^9{9)xd^9if)-deg{g) 

= /*(x) -|-. The result follows. 

□ 

In the following we are interested in providing necessary conditions for C 
to be a reverse-complement code. 

Theorem 3.8. Let C = (/(x)) be a skew cyclic code in Rn, where /(x) is 
monic polynomial of minimal degree. If C is reverse-complement then the 
polynomial /(x) is self-reciprocal and u(x” — l)/(x — 1) G C. 

Proof. Let C = (/(x)) be a skew cyclic code over R, where /(x) is monic 
polynomial of minimal degree in C. We know that (0,0, • • •, 0) G C, since C is 
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reverse-complement then(0, 0, • • G C i,e.; (0, 0, • • •, 0) = (f , v,---,v) G C, 
this vector correspond of the polynomial v + vx + ■ ■ ■ + vx‘^~^ = v{x"' — 
l)/(x — 1) G C. We have that f{x) is monic polynomial of minimal degree in 
C, where f{x) = 1 -|- aix -\- ■ ■ ■ + x^, the vector correspond to the polynomial 
f{x) is (1, ai, • • •, 0,0, • • •, 0), since C is reverse-complement and linear, then 
(l,ai,---,0,0,---,0)-GC, i.e., 

/’'c(x) = v+vx-\ - Vvx'^~^~‘^+{v+l)x'^~^~^ +at-ix’^~^ -\ - Vaix'^~‘^+vx’^~^ 

= /’’‘^(x) -|- x(x” — l)/(x — 1) G C. This implies that 
j.n-t-1 _|_ -I- • • • -I- (ai -I- v)x^~‘^ + G C. 

Multiplying on the right by we obtain, 

(1 -|- {at-i + v)9{\)x -I- • • • -I- (oi -|- u)0*“^(l)x*“^ -|- 0*(l)x*)x*“"^“^ G C. 
Hence, (1 -|- (a^-i -|- v)x + ■ —h (ai -|- v)x^~^ + x*) G C, which implies (thanks 
to Equation ( H])) that /*(x) = 1 -|- at-ix -!-•••• -|-x* G C. Since C = (/(x)), 
there exists q{x) G R[x,9] such that /*(x) = q{x)f{x), one necessary have 
q{x) = 1, that is /*(x) = /(x). 


□ 

Theorem 3.9. Let C = (u/i(x)) be a skew cyclic code in Rn, where /i(x) 
is a monic binary polynomial of lowest degree with /i(x)|(x” — 1). If C is a 
reverse-complement code then /i(x) is self-reciprocal. 

Proof. Suppose /i(x) = l-|-aix-|-a 2 X-| - \-x^ is a binary polynomial, the 

vector correspond of/(x) is u = (1, oi, •••, 0 ^- 1 ,1,0, 0,0, •••, 0, 0). Hence = 
(0, 0, 0, • • •, 0,1, dr-i, ■ ■ •, hi, 1). This vectors correspond of the polynomial 
/[‘^(x) = U -|- UX vx^~'^~‘^ -I- (u -|- -|- an-rX'^~'^ -|- • • • -|- 

aix^~‘^ -|- (u -|- = fl‘^ + v{x^ — l)(x — 1). Since C is a linear code then, 

-I- u(x" — l)(x — 1) G C. Therefore 

j.n-r-1 _|_ (hi -|- u)x"“^ -|- x"""^ G C, we multiply by 

we obtain 

1 + (h^_i + v)9{l)x^ + • • • + (hi + u)0'-i(l)x’'-i + 10’'(l)x’' G C. Then 
1 -|- (hr-_i -|- v)x^ (hi -|- v)x'^~^ -|- x'’ G C. By Equation ( |4]), we obtain 

fi{x) = 1 -|- Or-ix^ aix'’“^ -|- x*” G (5, hence vfifx) = 1 -|- v{ar-ix^ -|- 

• • • aix'’“^ -|- x'’) G C by the Corollary 13.31 we have vfl{x) = vfi{x)q{x), 

one necessary have q{x) = 1. Then /i(x) = /i(x). □ 

Theorem 3.10. Let C = {f{x),g{x)) be a skew cyclic code in Rn, where 
f{x) is a polynomial of minimal degree in C and is not a monic polynomial, 
g{x) is a polynomial of least degree among the monic polynomials in C. If C 
is a reverse-complement code then f{x) and g{x) are self-reciprocal. 
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Proof. The proof is similar to the proof of the Theorem 13.81 and of 
Theorem 13.91 □ 

In the following, we provide sufficient conditions for C being reverse- 
complement. 

Theorem 3.11. Let C = (/(x)) be a skew cyclic codes in Rn, where f{x) is 
monic polynomial of the degree minimal in C. If v{x^ — l)/(x — 1) € C and 
f{x) is self-reciprocal then C is reverse-complement. 

Proof. Let f{x) = 1 -t- aix + a 2 x‘^ + ■ ■ • + + x*" be a monic 

polynomial of the degree minimal in C and c(x) € C, we say that c(x) = 
q{x)f{x) where q{x) G R[x,6]. 

c(x)* = (g(x)/(x))*, by the Lemma ITTI we have c(x)* = q{x)*f{x)*, since 
/(x) is self-reciprocal then c(x)* = q{x)* f{x) G C for all c(x) G C. We know 
that 

(6) V vx nx"“^ G C 

Now, let c(x) = C 0 -I-C 1 X-I-C 2 X'''—hctx*, we multiply the right polynomial c(x) 

by we obtain c(x)*x"^“*“^ = cq-|-C i 0 (l)x-|-C 20 ^(l)x^H-|-Cf0*(l)x*, 

then 

(7) c(x) * = cox^-*-i + cix^-^ + • • • + ctx^-^ G C 

Combining ([6]) and ([71) we obtain 

(8) {v + vx-\ -h (co -h H- \-{ct-\- n)x”“^) G C, 

leading to the following equality (using Equation 0]) 

V + VX + --- + i;x”-*-2 + Cox”-*-l + ClX^^-* + • • • + + Ctx^-^ = (c(x))’’T 

Therefore, (c*(x)'’‘^)* = c(x)’’‘= G C. □ 

Theorem 3.12. Let C = {vfi{x)) be a skew cyclic code in Rn, where /i(x) 
is a monic binary polynomial of lowest degree with /i(x)|(x” — 1). //n(x"’ — 
l)/(x — 1) G C and /i(x) is self-reciprocal then C is reverse-complement. 

Proof. The proof is similar to the proof of Theorem 13.111 □ 

Theorem 3.13. Let C = {f{x),g{x)) be a skew cyclic codes in Rn, where 
f{x) is a polynomial of degree minimal in C and is not monic polynomial, 
g{x) is a polynomial of least degree among monic polynomial inC. If v{x^ — 
l)/(x — 1) G C and f{x) and g{x) are self-reciprocal then C is reverse- 
complement. 


Proof. The proof is similar to the one of Theorem 13.111 


□ 
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3.3. Binary Image of Skew DNA Cyclic Codes. We now recall the 
definition of the Gray map (/? for F 2 + i;F 2 to F 2 , for each element of F 2 
expressed as a + vb, where a,b G F 2 . ^p{a + vb) = {a + b, a) i.e; 0 —(0,0), 

1 ^ ( 1 ) 1 )) '^ + 1 ( 0 , 1 ), V ( 1 , 0 ). 

According to the definition of the Gray map it is easy to verify that (p is 
linear. 

We can obtained the binary image of the DNA code by the map p and 
the map tp, the DNA alphabet onto the set of the length 2 binary word given 
as following G —(0,0), A —>■ (1,1), T (0,1), C —>■ (1,0). We have the 
following property of the binary image of a DNA skew cyclic code. 

Corollary 3.14. The map R 1 —F 2 ” is distance preserving linear isometry, 
thus if C is a skew cyclic DNA code over R then p(C) is a skew quasi-cyclic 
DNA code of the length 2n and of the index 2. 

Proof. The proof is similar to the proof of Lemma 12.41 □ 

Table 5. [10,6,2] reverse-complement Skew Cyclic DNA Code 


GGGGGGGGGG 

GGGGCCCCCG 

GGGTTTTTGG 

GGGTAAAACG 

GGCCCCCGGG 

GGCCGGGCCG 

GGCAAAATCG 

GGCAAAATGG 

GTAAAACGGG 

GTAACCATTG 

GTACGGTACG 

GTACCCATGG 

GATTTTGGGG 

GTTTAACCCG 

GTTGCCAACG 

GTTGGGTTGG 


CCCCCCCCCC 

CCCCGGGGGC 

CCCAAAAACC 

CCCATTTTGC 

CCGGGGGCCC 

CCGGCCCGGC 

CCGTTTTAGC 

CCGTTTTACC 

CATTTTGCCC 

CATTGGTAAC 

CATGCCATGC 

CTAGGGTACC 

CTAAAACCCC 

CAAATTGGGC 

CAACGGTTGC 

CAACCCAACC 


CCCCCGGGGG 

CCCCGCCCCG 

CCCATAAACG 

CCGGGCCGGG 

CCGGCGGCCG 

CCGTATTACG 

CCGTTAATTG 

CATTAACGGG 

CAGTATGCCG 

CAAGCGTACG 

CAATTACCGG 

CAAAATGGGG 

CAAATACCCG 

CAACGCAACG 

CAACCGTTGG 

CCACGCAACG 


GGGGGCCCCC 

GGGGGGGGGG 

GGGTATTTGC 

GGCCCGGCCC 

GGCCGCCGGC 

GGCATAATGC 

GGCAATTAAC 

GTAATTGCCC 

GTCATACGGC 

GTTCGCATGC 

GTTAATGGCC 

GTTTTACCCC 

GTTTATGGGC 

GTTGGGTTGG 

GTTGGCAACC 

GGTGCGTTGC 
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